Action Recognition with Shot Boundary Detection and Decoded iDT Features
نویسندگان
چکیده
We report our results on the THUMOS Challenge 2014 Action Recognition Task. Given an untrimmed video as input, our method recognizes its major action category in four main steps: (1) detect shot boundaries within the video; (2) decode pre-computed indices of iDTFs within each shot; (3) encode the recovered iDTFs into shot-wise fisher vectors and compute the shot-to-category classification scores; (4) summarize shot-wise classification scores into a video-wise classification score. We report results on four different strategies that are used for summarizing the video-wise classification scores.
منابع مشابه
Ridiculously Fast Shot Boundary Detection with Fully Convolutional Neural Networks
Shot boundary detection (SBD) is an important component of many video analysis tasks, such as action recognition, video indexing, summarization and editing. Previous work typically used a combination of low-level features like color histograms, in conjunction with simple models such as SVMs. Instead, we propose to learn shot detection end-to-end, from pixels to final shot boundaries. For traini...
متن کاملAdvanced and Adaptive Shot Boundary Detection
In this paper, we describe a new approach of shot boundary detection in digital videos which is able to detect not only hard cuts but special effects like wipes, fades, and dissolves as well. Our approach is divided into two steps – feature extraction and shot boundary detection. First, three different features for the measurement of shot boundaries within the video are extracted. Second, detec...
متن کاملCUHK&SIAT Submission for THUMOS15 Action Recognition Challenge
This paper presents the method of our submission for THUMOS15 action recognition challenge. We propose a new action recognition system by exploiting very deep twostream ConvNets and Fisher vector representation of iDT features. Specifically, we utilize those successful very deep architectures in images such as GoogLeNet and VGGNet to design the two-stream ConvNets. From our experiments, we see ...
متن کاملAction Recognition and Detection by Combining Motion and Appearance Features
We present an action recognition and detection system from temporally untrimmed videos by combining motion and appearance features. Motion and appearance are two kinds of complementary cues for human action understanding from video. For motion features, we adopt the Fisher vector representation with improved dense trajectories due to its rich descriptive capacity. For appearance feature, we cho...
متن کاملUniversity of Marburg at TRECVID 2006: Shot Boundary Detection and Rushes Task Results
In this paper, we summarize our results for the shot boundary detection task and the rushes task at TRECVID 2006. The shot boundary detection approach which was evaluated last year at TRECVID 2005 served as a basis for our experiments this year and was modified in several ways. First, we investigated different parameter settings for the unsupervised approach. Second, we experimented with the po...
متن کامل